Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 54000 |
| Missing cells | 29 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 6.2 MiB |
| Average record size in memory | 120.0 B |
Variable types
| Text | 2 |
|---|---|
| DateTime | 2 |
| Numeric | 7 |
| Categorical | 4 |
DaysWorkedPerWeek is highly overall correlated with PartTimeFullTime | High correlation |
InitialIncurredClaimsCost is highly overall correlated with UltimateIncurredClaimCost | High correlation |
PartTimeFullTime is highly overall correlated with DaysWorkedPerWeek | High correlation |
UltimateIncurredClaimCost is highly overall correlated with InitialIncurredClaimsCost | High correlation |
Gender is highly imbalanced (51.0%) | Imbalance |
DependentsOther is highly imbalanced (96.6%) | Imbalance |
PartTimeFullTime is highly imbalanced (56.2%) | Imbalance |
HoursWorkedPerWeek is highly skewed (γ1 = 24.13297421) | Skewed |
InitialIncurredClaimsCost is highly skewed (γ1 = 26.85365748) | Skewed |
UltimateIncurredClaimCost is highly skewed (γ1 = 37.55250381) | Skewed |
ClaimNumber has unique values | Unique |
DependentChildren has 50639 (93.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-08-13 15:44:48.516469 |
|---|---|
| Analysis finished | 2024-08-13 15:45:01.248363 |
| Duration | 12.73 seconds |
| Software version | ydata-profiling vv4.9.0 |
| Download configuration | config.json |
ClaimNumber
Text
UNIQUE 
| Distinct | 54000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Characters and Unicode
| Total characters | 486000 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 54000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | WC8285054 |
|---|---|
| 2nd row | WC6982224 |
| 3rd row | WC5481426 |
| 4th row | WC9775968 |
| 5th row | WC2634037 |
| Value | Count | Frequency (%) |
| wc8285054 | 1 | < 0.1% |
| wc3735596 | 1 | < 0.1% |
| wc6049270 | 1 | < 0.1% |
| wc8595173 | 1 | < 0.1% |
| wc2826510 | 1 | < 0.1% |
| wc5481426 | 1 | < 0.1% |
| wc9775968 | 1 | < 0.1% |
| wc2634037 | 1 | < 0.1% |
| wc6828422 | 1 | < 0.1% |
| wc8058150 | 1 | < 0.1% |
| Other values (53990) | 53990 |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 54000 | |
| C | 54000 | |
| 9 | 39592 | |
| 7 | 39516 | |
| 8 | 39067 | |
| 6 | 38929 | |
| 3 | 38648 | |
| 5 | 38563 | |
| 4 | 38214 | |
| 2 | 38174 | |
| Other values (2) | 67297 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 486000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| W | 54000 | |
| C | 54000 | |
| 9 | 39592 | |
| 7 | 39516 | |
| 8 | 39067 | |
| 6 | 38929 | |
| 3 | 38648 | |
| 5 | 38563 | |
| 4 | 38214 | |
| 2 | 38174 | |
| Other values (2) | 67297 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 486000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| W | 54000 | |
| C | 54000 | |
| 9 | 39592 | |
| 7 | 39516 | |
| 8 | 39067 | |
| 6 | 38929 | |
| 3 | 38648 | |
| 5 | 38563 | |
| 4 | 38214 | |
| 2 | 38174 | |
| Other values (2) | 67297 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 486000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| W | 54000 | |
| C | 54000 | |
| 9 | 39592 | |
| 7 | 39516 | |
| 8 | 39067 | |
| 6 | 38929 | |
| 3 | 38648 | |
| 5 | 38563 | |
| 4 | 38214 | |
| 2 | 38174 | |
| Other values (2) | 67297 |
| Distinct | 36673 |
|---|---|
| Distinct (%) | 67.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
| Minimum | 1988-01-01 09:00:00+00:00 |
|---|---|
| Maximum | 2005-12-31 10:00:00+00:00 |
DateReported
Date
| Distinct | 6653 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
| Minimum | 1988-01-08 00:00:00+00:00 |
|---|---|
| Maximum | 2006-09-23 00:00:00+00:00 |
Age
Real number (ℝ)
| Distinct | 68 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.84237 |
| Minimum | 13 |
|---|---|
| Maximum | 81 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 18 |
| Q1 | 23 |
| median | 32 |
| Q3 | 43 |
| 95-th percentile | 56 |
| Maximum | 81 |
| Range | 68 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 12.122165 |
|---|---|
| Coefficient of variation (CV) | 0.3581949 |
| Kurtosis | -0.60607452 |
| Mean | 33.84237 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.53634113 |
| Sum | 1827488 |
| Variance | 146.94687 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 2108 | 3.9% |
| 21 | 2023 | 3.7% |
| 20 | 1987 | 3.7% |
| 22 | 1976 | 3.7% |
| 23 | 1962 | 3.6% |
| 25 | 1758 | 3.3% |
| 24 | 1756 | 3.3% |
| 18 | 1736 | 3.2% |
| 27 | 1660 | 3.1% |
| 28 | 1625 | 3.0% |
| Other values (58) | 35409 |
| Value | Count | Frequency (%) |
| 13 | 9 | < 0.1% |
| 14 | 34 | 0.1% |
| 15 | 158 | 0.3% |
| 16 | 519 | 1.0% |
| 17 | 1058 | |
| 18 | 1736 | |
| 19 | 2108 | |
| 20 | 1987 | |
| 21 | 2023 | |
| 22 | 1976 |
| Value | Count | Frequency (%) |
| 81 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 79 | 2 | < 0.1% |
| 78 | 2 | < 0.1% |
| 76 | 4 | < 0.1% |
| 75 | 5 | |
| 74 | 7 | |
| 73 | 1 | < 0.1% |
| 72 | 11 | |
| 71 | 7 |
Gender
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
| M | |
|---|---|
| F | |
| U | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 54000 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| M | 41660 | |
| F | 12338 | 22.8% |
| U | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 41660 | |
| f | 12338 | 22.8% |
| u | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 41660 | |
| F | 12338 | 22.8% |
| U | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 41660 | |
| F | 12338 | 22.8% |
| U | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 41660 | |
| F | 12338 | 22.8% |
| U | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 41660 | |
| F | 12338 | 22.8% |
| U | 2 | < 0.1% |
MaritalStatus
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Memory size | 422.0 KiB |
| S | |
|---|---|
| M | |
| U |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 53971 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | U |
| 4th row | S |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| S | 26161 | |
| M | 22516 | |
| U | 5294 | 9.8% |
| (Missing) | 29 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| s | 26161 | |
| m | 22516 | |
| u | 5294 | 9.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 26161 | |
| M | 22516 | |
| U | 5294 | 9.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 53971 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| S | 26161 | |
| M | 22516 | |
| U | 5294 | 9.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 53971 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| S | 26161 | |
| M | 22516 | |
| U | 5294 | 9.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 53971 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| S | 26161 | |
| M | 22516 | |
| U | 5294 | 9.8% |
DependentChildren
Real number (ℝ)
ZEROS 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.11918519 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 50639 |
| Zeros (%) | 93.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.51778002 |
|---|---|
| Coefficient of variation (CV) | 4.3443321 |
| Kurtosis | 30.006216 |
| Mean | 0.11918519 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.1123793 |
| Sum | 6436 |
| Variance | 0.26809615 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 50639 | |
| 2 | 1361 | 2.5% |
| 1 | 1273 | 2.4% |
| 3 | 528 | 1.0% |
| 4 | 150 | 0.3% |
| 5 | 42 | 0.1% |
| 6 | 5 | < 0.1% |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 50639 | |
| 1 | 1273 | 2.4% |
| 2 | 1361 | 2.5% |
| 3 | 528 | 1.0% |
| 4 | 150 | 0.3% |
| 5 | 42 | 0.1% |
| 6 | 5 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 42 | 0.1% |
| 4 | 150 | 0.3% |
| 3 | 528 | 1.0% |
| 2 | 1361 | 2.5% |
| 1 | 1273 | 2.4% |
| 0 | 50639 |
DependentsOther
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
| 0 | |
|---|---|
| 1 | 462 |
| 2 | 23 |
| 3 | 8 |
| 5 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 54000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 53506 | |
| 1 | 462 | 0.9% |
| 2 | 23 | < 0.1% |
| 3 | 8 | < 0.1% |
| 5 | 1 | < 0.1% |
WeeklyWages
Real number (ℝ)
| Distinct | 13211 |
|---|---|
| Distinct (%) | 24.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 416.36481 |
| Minimum | 1 |
|---|---|
| Maximum | 7497 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 200 |
| Q1 | 200 |
| median | 392.2 |
| Q3 | 500 |
| 95-th percentile | 817.0055 |
| Maximum | 7497 |
| Range | 7496 |
| Interquartile range (IQR) | 300 |
Descriptive statistics
| Standard deviation | 248.63867 |
|---|---|
| Coefficient of variation (CV) | 0.59716543 |
| Kurtosis | 68.023352 |
| Mean | 416.36481 |
| Median Absolute Deviation (MAD) | 152.2 |
| Skewness | 4.1227669 |
| Sum | 22483700 |
| Variance | 61821.188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 200 | 12372 | 22.9% |
| 500 | 4271 | 7.9% |
| 300 | 570 | 1.1% |
| 400 | 389 | 0.7% |
| 350 | 336 | 0.6% |
| 600 | 294 | 0.5% |
| 450 | 193 | 0.4% |
| 250 | 186 | 0.3% |
| 289.93 | 145 | 0.3% |
| 480 | 127 | 0.2% |
| Other values (13201) | 35117 |
| Value | Count | Frequency (%) |
| 1 | 122 | |
| 1.91 | 1 | < 0.1% |
| 3.59 | 1 | < 0.1% |
| 3.95 | 2 | < 0.1% |
| 4.61 | 1 | < 0.1% |
| 4.73 | 2 | < 0.1% |
| 5 | 16 | < 0.1% |
| 5.25 | 2 | < 0.1% |
| 5.49 | 2 | < 0.1% |
| 5.78 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7497 | 3 | |
| 7400 | 1 | < 0.1% |
| 6453 | 1 | < 0.1% |
| 4556 | 2 | |
| 4311.3 | 1 | < 0.1% |
| 3750 | 4 | |
| 3500 | 2 | |
| 2956.52 | 2 | |
| 2817.92 | 1 | < 0.1% |
| 2766.04 | 2 |
PartTimeFullTime
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
| F | |
|---|---|
| P | 4888 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 54000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | F |
| 4th row | F |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 49112 | |
| P | 4888 | 9.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 49112 | |
| p | 4888 | 9.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 49112 | |
| P | 4888 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 49112 | |
| P | 4888 | 9.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 49112 | |
| P | 4888 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 54000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 49112 | |
| P | 4888 | 9.1% |
HoursWorkedPerWeek
Real number (ℝ)
SKEWED 
| Distinct | 424 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.735084 |
| Minimum | 0 |
|---|---|
| Maximum | 640 |
| Zeros | 29 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 22.3815 |
| Q1 | 38 |
| median | 38 |
| Q3 | 40 |
| 95-th percentile | 40 |
| Maximum | 640 |
| Range | 640 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 12.568704 |
|---|---|
| Coefficient of variation (CV) | 0.3330774 |
| Kurtosis | 910.21194 |
| Mean | 37.735084 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 24.132974 |
| Sum | 2037694.6 |
| Variance | 157.97231 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 38 | 30829 | |
| 40 | 13283 | |
| 20 | 894 | 1.7% |
| 30 | 837 | 1.6% |
| 35 | 743 | 1.4% |
| 37.5 | 663 | 1.2% |
| 25 | 414 | 0.8% |
| 50 | 322 | 0.6% |
| 15 | 301 | 0.6% |
| 45 | 284 | 0.5% |
| Other values (414) | 5430 | 10.1% |
| Value | Count | Frequency (%) |
| 0 | 29 | |
| 1 | 31 | |
| 2 | 6 | < 0.1% |
| 2.1 | 1 | < 0.1% |
| 3 | 26 | |
| 3.5 | 5 | < 0.1% |
| 4 | 34 | |
| 4.1 | 1 | < 0.1% |
| 4.5 | 4 | < 0.1% |
| 5 | 50 |
| Value | Count | Frequency (%) |
| 640 | 1 | < 0.1% |
| 638 | 2 | |
| 627 | 2 | |
| 538.3 | 2 | |
| 462.08 | 2 | |
| 450 | 3 | |
| 417.2 | 1 | < 0.1% |
| 410 | 2 | |
| 400 | 3 | |
| 389 | 3 |
DaysWorkedPerWeek
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9057593 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.55212911 |
|---|---|
| Coefficient of variation (CV) | 0.11254713 |
| Kurtosis | 18.240675 |
| Mean | 4.9057593 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -3.3404679 |
| Sum | 264911 |
| Variance | 0.30484655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 49185 | |
| 4 | 1476 | 2.7% |
| 3 | 1436 | 2.7% |
| 6 | 884 | 1.6% |
| 2 | 513 | 0.9% |
| 7 | 323 | 0.6% |
| 1 | 183 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 183 | 0.3% |
| 2 | 513 | 0.9% |
| 3 | 1436 | 2.7% |
| 4 | 1476 | 2.7% |
| 5 | 49185 | |
| 6 | 884 | 1.6% |
| 7 | 323 | 0.6% |
| Value | Count | Frequency (%) |
| 7 | 323 | 0.6% |
| 6 | 884 | 1.6% |
| 5 | 49185 | |
| 4 | 1476 | 2.7% |
| 3 | 1436 | 2.7% |
| 2 | 513 | 0.9% |
| 1 | 183 | 0.3% |
ClaimDescription
Text
| Distinct | 28114 |
|---|---|
| Distinct (%) | 52.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 422.0 KiB |
Length
| Max length | 94 |
|---|---|
| Median length | 74 |
| Mean length | 43.453704 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2346500 |
|---|---|
| Distinct characters | 29 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 21436 ? |
|---|---|
| Unique (%) | 39.7% |
Sample
| 1st row | LIFTING TYRE INJURY TO RIGHT ARM AND WRIST INJURY |
|---|---|
| 2nd row | STEPPED AROUND CRATES AND TRUCK TRAY FRACTURE LEFT FOREARM |
| 3rd row | CUT ON SHARP EDGE CUT LEFT THUMB |
| 4th row | DIGGING LOWER BACK LOWER BACK STRAIN |
| 5th row | REACHING ABOVE SHOULDER LEVEL ACUTE MUSCLE STRAIN LEFT SIDE OF STOMACH |
| Value | Count | Frequency (%) |
| right | 22648 | 6.0% |
| left | 20756 | 5.5% |
| back | 16346 | 4.3% |
| strain | 15259 | 4.0% |
| lower | 9950 | 2.6% |
| and | 9103 | 2.4% |
| finger | 8584 | 2.3% |
| lifting | 8300 | 2.2% |
| hand | 7723 | 2.0% |
| struck | 7354 | 1.9% |
| Other values (3718) | 253028 |
Most occurring characters
| Value | Count | Frequency (%) |
| 325051 | ||
| E | 216976 | 9.2% |
| I | 176650 | 7.5% |
| T | 176030 | 7.5% |
| R | 171992 | 7.3% |
| N | 153319 | 6.5% |
| A | 138288 | 5.9% |
| L | 127791 | 5.4% |
| S | 99153 | 4.2% |
| O | 94967 | 4.0% |
| Other values (19) | 666283 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2346500 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 325051 | ||
| E | 216976 | 9.2% |
| I | 176650 | 7.5% |
| T | 176030 | 7.5% |
| R | 171992 | 7.3% |
| N | 153319 | 6.5% |
| A | 138288 | 5.9% |
| L | 127791 | 5.4% |
| S | 99153 | 4.2% |
| O | 94967 | 4.0% |
| Other values (19) | 666283 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2346500 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 325051 | ||
| E | 216976 | 9.2% |
| I | 176650 | 7.5% |
| T | 176030 | 7.5% |
| R | 171992 | 7.3% |
| N | 153319 | 6.5% |
| A | 138288 | 5.9% |
| L | 127791 | 5.4% |
| S | 99153 | 4.2% |
| O | 94967 | 4.0% |
| Other values (19) | 666283 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2346500 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 325051 | ||
| E | 216976 | 9.2% |
| I | 176650 | 7.5% |
| T | 176030 | 7.5% |
| R | 171992 | 7.3% |
| N | 153319 | 6.5% |
| A | 138288 | 5.9% |
| L | 127791 | 5.4% |
| S | 99153 | 4.2% |
| O | 94967 | 4.0% |
| Other values (19) | 666283 |
InitialIncurredClaimsCost
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1989 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7841.146 |
| Minimum | 1 |
|---|---|
| Maximum | 2000000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 315 |
| Q1 | 700 |
| median | 2000 |
| Q3 | 9500 |
| 95-th percentile | 30000 |
| Maximum | 2000000 |
| Range | 1999999 |
| Interquartile range (IQR) | 8800 |
Descriptive statistics
| Standard deviation | 20584.075 |
|---|---|
| Coefficient of variation (CV) | 2.625136 |
| Kurtosis | 1888.277 |
| Mean | 7841.146 |
| Median Absolute Deviation (MAD) | 1500 |
| Skewness | 26.853657 |
| Sum | 4.2342188 × 108 |
| Variance | 4.2370414 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 6260 | 11.6% |
| 1000 | 4587 | 8.5% |
| 10000 | 3453 | 6.4% |
| 3500 | 2600 | 4.8% |
| 7500 | 2507 | 4.6% |
| 1500 | 2329 | 4.3% |
| 2000 | 1607 | 3.0% |
| 9500 | 1255 | 2.3% |
| 5000 | 928 | 1.7% |
| 25000 | 831 | 1.5% |
| Other values (1979) | 27643 |
| Value | Count | Frequency (%) |
| 1 | 46 | |
| 9 | 2 | < 0.1% |
| 10 | 3 | < 0.1% |
| 30 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 50 | 8 | < 0.1% |
| 55 | 1 | < 0.1% |
| 60 | 4 | < 0.1% |
| 70 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 2000000 | 1 | |
| 872980 | 1 | |
| 830000 | 2 | |
| 725000 | 1 | |
| 690000 | 1 | |
| 540000 | 1 | |
| 500000 | 1 | |
| 450000 | 1 | |
| 425000 | 2 | |
| 421000 | 1 |
UltimateIncurredClaimCost
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 53999 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11003.369 |
| Minimum | 121.88681 |
|---|---|
| Maximum | 4027135.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 422.0 KiB |
Quantile statistics
| Minimum | 121.88681 |
|---|---|
| 5-th percentile | 306.55693 |
| Q1 | 926.33845 |
| median | 3371.2417 |
| Q3 | 8197.2486 |
| 95-th percentile | 45224.184 |
| Maximum | 4027135.9 |
| Range | 4027014 |
| Interquartile range (IQR) | 7270.9102 |
Descriptive statistics
| Standard deviation | 33390.991 |
|---|---|
| Coefficient of variation (CV) | 3.0346152 |
| Kurtosis | 3940.8638 |
| Mean | 11003.369 |
| Median Absolute Deviation (MAD) | 2786.0995 |
| Skewness | 37.552504 |
| Sum | 5.9418194 × 108 |
| Variance | 1.1149583 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1092.841276 | 2 | < 0.1% |
| 4748.203388 | 1 | < 0.1% |
| 3979.777262 | 1 | < 0.1% |
| 29888.15421 | 1 | < 0.1% |
| 752.4568306 | 1 | < 0.1% |
| 285.4824447 | 1 | < 0.1% |
| 2033.28861 | 1 | < 0.1% |
| 18299.90692 | 1 | < 0.1% |
| 1839.359986 | 1 | < 0.1% |
| 6285.121747 | 1 | < 0.1% |
| Other values (53989) | 53989 |
| Value | Count | Frequency (%) |
| 121.8868054 | 1 | |
| 123.1648797 | 1 | |
| 124.5796609 | 1 | |
| 129.1061303 | 1 | |
| 131.457013 | 1 | |
| 132.9995569 | 1 | |
| 134.3203499 | 1 | |
| 138.4751978 | 1 | |
| 139.8539663 | 1 | |
| 140.1828009 | 1 |
| Value | Count | Frequency (%) |
| 4027135.935 | 1 | |
| 865770.6486 | 1 | |
| 823706.3012 | 1 | |
| 768485.1182 | 1 | |
| 742003.2335 | 1 | |
| 741498.0275 | 1 | |
| 713784.0636 | 1 | |
| 608650.4259 | 1 | |
| 586912.8191 | 1 | |
| 558408.9616 | 1 |
| Age | DaysWorkedPerWeek | DependentChildren | DependentsOther | Gender | HoursWorkedPerWeek | InitialIncurredClaimsCost | MaritalStatus | PartTimeFullTime | UltimateIncurredClaimCost | WeeklyWages | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Age | 1.000 | 0.032 | 0.097 | 0.024 | 0.065 | 0.027 | 0.196 | 0.405 | 0.090 | 0.234 | 0.210 |
| DaysWorkedPerWeek | 0.032 | 1.000 | 0.019 | 0.008 | 0.141 | 0.429 | -0.002 | 0.033 | 0.712 | -0.005 | 0.172 |
| DependentChildren | 0.097 | 0.019 | 1.000 | 0.119 | 0.001 | 0.056 | 0.037 | 0.161 | 0.019 | 0.057 | 0.109 |
| DependentsOther | 0.024 | 0.008 | 0.119 | 1.000 | 0.010 | 0.000 | 0.024 | 0.057 | 0.000 | 0.000 | 0.023 |
| Gender | 0.065 | 0.141 | 0.001 | 0.010 | 1.000 | 0.010 | 0.000 | 0.023 | 0.249 | 0.000 | 0.051 |
| HoursWorkedPerWeek | 0.027 | 0.429 | 0.056 | 0.000 | 0.010 | 1.000 | 0.014 | 0.005 | 0.014 | 0.026 | 0.287 |
| InitialIncurredClaimsCost | 0.196 | -0.002 | 0.037 | 0.024 | 0.000 | 0.014 | 1.000 | 0.009 | 0.001 | 0.883 | 0.301 |
| MaritalStatus | 0.405 | 0.033 | 0.161 | 0.057 | 0.023 | 0.005 | 0.009 | 1.000 | 0.022 | 0.000 | 0.053 |
| PartTimeFullTime | 0.090 | 0.712 | 0.019 | 0.000 | 0.249 | 0.014 | 0.001 | 0.022 | 1.000 | 0.000 | 0.064 |
| UltimateIncurredClaimCost | 0.234 | -0.005 | 0.057 | 0.000 | 0.000 | 0.026 | 0.883 | 0.000 | 0.000 | 1.000 | 0.350 |
| WeeklyWages | 0.210 | 0.172 | 0.109 | 0.023 | 0.051 | 0.287 | 0.301 | 0.053 | 0.064 | 0.350 | 1.000 |
| ClaimNumber | DateTimeOfAccident | DateReported | Age | Gender | MaritalStatus | DependentChildren | DependentsOther | WeeklyWages | PartTimeFullTime | HoursWorkedPerWeek | DaysWorkedPerWeek | ClaimDescription | InitialIncurredClaimsCost | UltimateIncurredClaimCost | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | WC8285054 | 2002-04-09T07:00:00Z | 2002-07-05T00:00:00Z | 48 | M | M | 0 | 0 | 500.00 | F | 38.0 | 5 | LIFTING TYRE INJURY TO RIGHT ARM AND WRIST INJURY | 1500 | 4748.203388 |
| 1 | WC6982224 | 1999-01-07T11:00:00Z | 1999-01-20T00:00:00Z | 43 | F | M | 0 | 0 | 509.34 | F | 37.5 | 5 | STEPPED AROUND CRATES AND TRUCK TRAY FRACTURE LEFT FOREARM | 5500 | 6326.285819 |
| 2 | WC5481426 | 1996-03-25T00:00:00Z | 1996-04-14T00:00:00Z | 30 | M | U | 0 | 0 | 709.10 | F | 38.0 | 5 | CUT ON SHARP EDGE CUT LEFT THUMB | 1700 | 2293.949087 |
| 3 | WC9775968 | 2005-06-22T13:00:00Z | 2005-07-22T00:00:00Z | 41 | M | S | 0 | 0 | 555.46 | F | 38.0 | 5 | DIGGING LOWER BACK LOWER BACK STRAIN | 15000 | 17786.487170 |
| 4 | WC2634037 | 1990-08-29T08:00:00Z | 1990-09-27T00:00:00Z | 36 | M | M | 0 | 0 | 377.10 | F | 38.0 | 5 | REACHING ABOVE SHOULDER LEVEL ACUTE MUSCLE STRAIN LEFT SIDE OF STOMACH | 2800 | 4014.002925 |
| 5 | WC6828422 | 1999-06-21T11:00:00Z | 1999-09-09T00:00:00Z | 50 | M | M | 0 | 0 | 200.00 | F | 38.0 | 5 | STRUCK HEAD ON HEAD LACERATED HEAD | 500 | 598.762315 |
| 6 | WC8058150 | 2001-07-13T11:00:00Z | 2001-07-23T00:00:00Z | 39 | M | M | 0 | 0 | 200.00 | F | 38.0 | 5 | FINGER BRUISED AND SWOLLEN LEFT ARM | 500 | 279.068178 |
| 7 | WC7539849 | 2000-03-09T09:00:00Z | 2000-04-16T00:00:00Z | 56 | M | M | 0 | 0 | 200.00 | F | 40.0 | 5 | CLEANING LEFT SHOULDER SPLINTER LEFT HAND | 500 | 1877.172243 |
| 8 | WC4427179 | 1994-03-24T16:00:00Z | 1994-04-26T00:00:00Z | 49 | M | M | 0 | 0 | 623.60 | F | 38.0 | 5 | JACK SLIPPED CATCHING FINGER CUT LEFT LITTLE FINGER | 925 | 1254.129811 |
| 9 | WC9907636 | 2005-12-07T11:00:00Z | 2005-12-22T00:00:00Z | 30 | M | S | 0 | 0 | 857.28 | F | 37.0 | 5 | STRUCK PINE DUST ABRASION LEFT EYE IRRITATION | 1500 | 1031.603044 |
| ClaimNumber | DateTimeOfAccident | DateReported | Age | Gender | MaritalStatus | DependentChildren | DependentsOther | WeeklyWages | PartTimeFullTime | HoursWorkedPerWeek | DaysWorkedPerWeek | ClaimDescription | InitialIncurredClaimsCost | UltimateIncurredClaimCost | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 53990 | WC3485403 | 1992-03-04T11:00:00Z | 1992-03-19T00:00:00Z | 32 | M | M | 0 | 0 | 384.03 | F | 38.0 | 5 | LIFTING DOOR INJURED SHOULDER AND LEFT FOREARM LEFT HIP | 1200 | 2633.844395 |
| 53991 | WC7426219 | 2000-08-28T15:00:00Z | 2000-11-16T00:00:00Z | 21 | M | S | 0 | 0 | 451.83 | F | 38.0 | 5 | SLICING VEGETABLES LACERATION RIGHT INDEX FINGER LACERATION | 1000 | 1248.103245 |
| 53992 | WC6263025 | 1997-04-18T11:00:00Z | 1997-04-28T00:00:00Z | 21 | M | S | 0 | 0 | 200.00 | F | 38.0 | 5 | STRUCK HAND WITH ALLEN KEY LACERATION LEFT HAND | 500 | 233.289431 |
| 53993 | WC4447156 | 1994-07-07T18:00:00Z | 1994-10-01T00:00:00Z | 47 | M | M | 0 | 0 | 532.00 | F | 38.0 | 5 | FELL FLOOR MAT STRAIN LOWER BACK AND NECK | 10000 | 8196.288506 |
| 53994 | WC7006507 | 1999-07-19T11:00:00Z | 2001-03-04T00:00:00Z | 35 | M | M | 0 | 0 | 200.00 | F | 40.0 | 5 | FELL STAIRS BRUISE RIGHT ANKLE AND RIGHT LEG | 15000 | 11847.081780 |
| 53995 | WC9370727 | 2004-08-21T18:00:00Z | 2004-09-08T00:00:00Z | 32 | F | S | 0 | 0 | 500.00 | F | 38.0 | 5 | STRUCK KNIFE LACERATED LEFT MIDDLE FINGER LEFT HAND | 1000 | 480.493308 |
| 53996 | WC8396269 | 2002-04-28T09:00:00Z | 2002-09-03T00:00:00Z | 20 | F | S | 0 | 0 | 500.00 | F | 40.0 | 5 | LEFT HAND LACERATION LEFT SIDE BACK AND LEFT LEG | 1000 | 755.735319 |
| 53997 | WC3609528 | 1992-02-28T09:00:00Z | 1992-03-18T00:00:00Z | 19 | M | S | 0 | 0 | 283.00 | F | 40.0 | 5 | METAL SLIPPED ACROSS METAL CUT FINGER | 210 | 418.178461 |
| 53998 | WC5038565 | 1995-01-10T07:00:00Z | 1995-01-31T00:00:00Z | 24 | M | S | 0 | 0 | 200.00 | F | 38.0 | 5 | BURN WHILST USING SPANNER LACERATION RIGHT MIDDLE FINGER | 7500 | 2695.225700 |
| 53999 | WC2542601 | 1990-10-24T14:00:00Z | 1990-11-03T00:00:00Z | 22 | M | S | 0 | 0 | 200.00 | F | 38.0 | 5 | CUT WITH BREAD KNIFE LACERATION LEFT INDEX AND MIDDLE FINGERS | 550 | 934.273548 |